A Methodology of Constructing Canonical Form Database Schemas in a Multiple Heterogenous Database Environment

نویسندگان

  • Jeong Seok Lim
  • Dong-Guk Shin
چکیده

Databases are usually developed independently by each group or organization to meet their own individual requirements. However, as interactions between groups and organizations become increasingly common, many applications end up requiring data not from one single database but from several related databases. Thus, users have to deal with databases whose designs have never been coordinated. Interoperability between these multiple heterogeneous databases has been a hot research topic. Previously in Lim et al. (1997), we proposed the Query Clearing House (QCH) model, which aims at assisting users in finding information in multiple heterogeneous databases. QCH plays the role of mediating between the end users' data needs and the local databases' data publication needs. The objective of this paper is to propose a uniform schema conversion method with which one can transform local relational database schemas into one cohesive format-i.e, into canonical form schema expressions. These canonical form expressions are then used as the basis for constructing two important resources for the QCH model: meta-data and mapping libraries. The meta-data includes not only the schematic information for the local databases, but also additional semantics that are not available in the database schemas themselves. The mapping library for each database has the mapping information between the general terms used in the meta-data and the specific terms used in the database schema so that a user's query can be systematically transformed into a database-specific query expression. We use L K ,which was initially presented in Shin (1991, 1994), as the representation formalism for expressing canoni-cal form schemas and the meta-data. L K is known for its flexible, descriptive power, which facilitates expressing general concepts at the desired level of granularity. The language is also known for its versatile association mechanism, which The Query Clearing House (QCH) model aims at achieving an ideal heterogeneous multiple database environment in which users can submit queries without concern for the location of the data or the specifics of the relevant database schemas. One important prerequisite for building such an environment is to organize local database schemas in a cohesive way so that a systematic method of determining data relevancy can be developed. In this paper, we propose a method for converting multiple local relational schemas into canonical form expressions. The resulting canonical form schema expressions include much more descriptive data semantics than what is offered by the original database schemas. These expressions also include information for mapping between …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Object Query Language for Multimedia Federations

The F́ıschlár system provides a large centralised repository of multimedia files. As expansion is difficult in centralised systems and as different user groups have a requirement to define their own schemas, the EGTV (Efficient Global Transactions for Video) project was established to examine how the distribution of this database could be managed. The federated database approach is advocated whe...

متن کامل

Integrating relational and object-oriented database systems using a metaclass concept

This paper presents a specific approach of integrating a relational database system into a federated database system. The underlying database integration process consist of three steps: first, the external database systems have to be connected to the integrated database system environment and the external data models have to be mapped into a canonical data model. This step is often called synta...

متن کامل

Constructing the Bayesian network structure from dependencies implied in multiple relational schemas

Relational models are the most common representation of structured data, and acyclic database theory is important in relational databases. In this paper, we propose the method for constructing the Bayesian network structure from dependencies implied in multiple relational schemas. Based on the acyclic database theory and its relationships with probabilistic networks, we are to construct the Bay...

متن کامل

Schema Management for Document Stores

Document stores that provide the efficiency of a schema-less interface are widely used by developers in mobile and cloud applications. However, the simplicity developers achieved controversially leads to complexity for data management due to lack of a schema. In this paper, we present a schema management framework for document stores. This framework discovers and persists schemas of JSON record...

متن کامل

Building Parameterized Canonical Representations to Achieve Interoperability among Heterogeneous Databases

This paper describes a technique to support interoperable query processing when multiple heterogeneous databases are accessed. We focus on the problem of supporting query transformation transparently , so a user can pose queries locally, without any need of global knowledge about diierent data models and schemas. To support interoperable query transformation, we need to resolve the connicts (i....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • J. Database Manag.

دوره 9  شماره 

صفحات  -

تاریخ انتشار 1998